Advanced Prosody Modelling
نویسندگان
چکیده
A formal prosody model is proposed together with its application in a text-to-speech system. The model is based on a generative grammar of abstract prosodic functionally involved units. This grammar creates for each sentence a structure of immediate prosodic constituents in the form of a tree. Each prosodic word of a sentence is assigned with a description vector by a description function and this vector is used by a realization function to create appropriate intonation for the prosodic word. Parameters of the model are automatically set up using real speech data from a prosody corpus, which is also described.
منابع مشابه
MeLos: Analysis and Modelling of Speech Prosody and Speaking Style
This thesis addresses the issue of modelling speech prosody for speech synthesis, and presents MeLos: a complete system for the analysis and modelling of speech prosody “the music of speech”. Research into the analysis and modelling of speech prosody has increased dramatically in recent decades, and speech prosody has emerged as a crucial concern for speech synthesis. The issue of speech prosod...
متن کاملAn Overview of Prosodic Modelling for Croatian Speech Synthesis
In order to include prosody into the text to speech (TTS) systems prosody knowledge needs to be acquired, represented and incorporated. Two main features of prosody important for modelling prosody for TTS systems are duration and F0 contour. There are various approaches to modelling those features and they can be categorized into three main groups: rule based, statistical and minimalistic. Some...
متن کاملStatistical Modelling of Speech Segment Duration by Constrained Tree Regression
This paper presents a new method for statistical modelling of prosody control in speech synthesis. The proposed method, which is referred to as Constrained Tree Regression (CTR), can make suitable representation of complex effects of control factors for prosody with a moderate amount of learning data. It is based on recursive splits of predictor variable spaces and partial imposition of constra...
متن کاملImprovements in Prosodic Processing for Speech Synthesis
For the synthesis of French, a separate modelling of fundamental frequency and timing seems more appropriate than derived modelling. Satisfactory declarative prosody in synthesis can be obtained with a version of Fujisaki F0 modelling that takes into account syllable amplitude differences as well as microprosody. Statistical methods based on both segmental and lexical input provide a satisfacto...
متن کاملStylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations
In this paper, a unified trajectory model based on the stylization and the modelling of f0 variations simultaneously over various temporal domains is proposed. The syllable is used as the minimal temporal domain for the description of speech prosody, and short-term and long-term f0 variations are stylized and modelled simultaneously over various temporal domains. During the training, a context-...
متن کامل